Emergence of communities in weighted networks 
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Topology and weights are closely related in weighted complex networks and this is reflected in 
their modular structure. We present a simple network model where the weights are generated 
dynamically and they shape the developing topology. By tuning a model parameter governing the 
importance of weights, the resulting networks undergo a gradual structural transition from a module 
free topology to one with communities. The model also reproduces many features of large social 
networks, including the "weak links" property. 
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Network theory has undergone a remarkable develop- 
ment over the last decade and has contributed signifi- 
cantly to our understanding of complex systems, ranging 
from genetic transcriptions to the Internet and human 
societies [TJ Many complex networks are structured 
in terms of modules, or communities, which are groups of 
nodes characterized by having more internal than exter- 
nal connections between them. Such a mesoscopic net- 
work structure is expected to play a concrete functional 
role. Consequently, it is an important problem to under- 
stand how the communities emerge during the growth 
of the network. Apart from these issues of topologi- 
cal nature, it is important to realize that many complex 
networks are weighted, i.e., the interaction between two 
nodes is characterized not only by the existence of a link 
but a link with a varying weight assigned to it. There 
are a number of examples, like traffic, metabolic or cor- 
relation based networks, which provide ample evidence 
that the weights have to be included in their analysis. 
In many cases the weights affect significantly the proper- 
ties or function of these networks, e.g., disease spreading 
[3] , synchronisation dynamics of oscillators [4 , and motif 
statistics [5 J . It is natural to expect that weights have an 
influence on the formation of communities, which is the 
very issue of our study. 

Earlier, coupled weight-topology dynamics have been 
used successfully in transport networks modeling [6], 
which, however, does not lead to community structure. 
We show that there are mechanisms, by which weights 
play a crucial role in community formation. While we 
believe this to be quite a general paradigm for commu- 
nity formation, we have chosen to explore it within the 
realm of social systems where large datasets have en- 
abled looking into both the coupling of network topology 
and interaction strengths and properties of communities 
[7J HJ E] • Understanding how the underlying microscopic 
mechanisms translate into mesoscopic communities and 
macroscopic social systems is a key problem in its own 
right and one that is accessible within the scope of sta- 




FIG. 1: The model algorithm, (a): a weighted local search 
starts from i and proceeds first to j and then to k, which is a 
neighbor of i. (b): the local search from i ends to k' , which is 
not a neighbor of i. In this case link w ik i is established with 
probability pa- (c): node i creates a random link to random 
node I with probability p r . In cases a) and b) the weights of 
involved links are increased by S. 



tistical physics. 

Large scale social networks are known to satisfy the 
weak links hypothesis |10] with the implication that weak 
links keep the network connected whereas strong links are 
mostly associated with communities [24 . This weight- 
topology coupling results from the microscopic mecha- 
nisms that govern the evolution of social networks. Net- 
work sociology identifies (a) cyclic closure and (b) focal 
closure as the two fundamental mechanisms of tie for- 
mation Cyclic closure refers to forming ties with 
one's network neighbors - "friends of friends". Experi- 
mental evidence indicates that the probability of cyclic 
closure decreases roughly exponentially as a function of 
the geodesic distance [T2]. Focal closure, in contrast, 
refers to forming ties independently of the geodesic dis- 
tance and is attributed to forming social ties through 
shared activities (hobbies etc.) jTT] . These two mecha- 
nisms form the basis of the topological rules of our model. 
As for the weights, we have chosen a simple scenario in 
which new ties are created preferably through strong ties, 
every interaction making them even stronger. 

We consider a fixed size network of N nodes, where 
links are created in two ways: First, in time interval At 
each node having at least one neighbor starts a weighted 
local search for new acquaintances, Fig. [lja,b). More 
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FIG. 2: (Color online) Snowball samples 13] of networks with 
(a) 5 = 0, (b) 5 = 0.1, (c) 8 = 0.5, and (d) 5 = 1. Link col- 
ors change from green (weak links) to yellow and red (strong 
links) . 



specifically, node i chooses one of its neighbors, node j, 
with probability Wij/si, where Wij is the weight of the 



link connecting i and j and Sj = y\ toy the strength 
of i. If the chosen node j has other neighbors apart 
from i, it chooses one of them, say fc, with probability 



W jk /(Sj 



- w. 



j). Therefore, the search favors strong links. 



If there is no link between i and k, it is established with 
probability p&At such that uiik = wq. If the link exists, 
its weight is increased by an amount 6. In both cases, w^j 
and Wjk are also increased by 8. We call this attachment 
mechanism local attachment (LA) , and it corresponds to 
a special case, triadic closure, of the above mentioned 
cyclic closure mechanism. Second, if a node has no links, 
or otherwise with probability p r At 7 it creates a link of 
weight wq to a random node, Fig. [ijc) . This mechanism 
corresponds to establishing a new interaction outside the 
immediate neighborhood of the chosen node, analogu- 
ously to focal closure, and we call it global attachment 
(GA). Finally, any node can be removed with probabil- 
ity paAt by the node deletion (ND) mechanism, in which 
case also the adjacent links are removed. The removed 
node gets replaced by a new node, such that the size of 
the system remains fixed at N. ND is the only mecha- 
nism that decreases the number of links in the model and 
leads to exponentially distributed lifetime r for nodes and 
t w for links, averages of which are 



(r) = (paAt)- 1 
(t w ) = (2p d Aty\ 



(1) 



FIG. 3: iifc=4 (□) and (n s ) (A) as a function of 8. Results 
are averaged over 10 realizations of N — 5 x 10 4 networks. 
Error bars are measured standard deviations. 



The model was studied by simulations, which were 
started from a seed network of N nodes without any links. 
Subsequent changes due to LA and GA mechanisms were 
updated using parallel update, followed by the ND step. 
We set At = 1 and wq = 1 without loss of generality. The 
time scale is fixed by the death rate pd, which was set to 
10~ 3 corresponding to (r) = 10 3 time units. The random 
link probability was set to p r — 5 x 1CP 4 , corresponding 
to adding on average one random link for each node dur- 
ing average node lifetime. The network algorithm was 
found to reach steady state in ~ 10 — 20 average node 
lifetimes, after which all measured characteristics were 
found to be stationary. The following results were ob- 
tained by running the simulations for 25 x 10 3 time steps, 
i.e., 25 average node lifetimes. 

The weights enter the model dynamics through pa- 
rameter 6. In order to compare networks resulting from 
different values of 5, we have chosen to keep the average 
degree (fc) sa 10 constant. Thus, the number of links is 
roughly equal in all networks and changes in the structure 
result solely from restructuring of the links. Keeping (k) 
constant requires adjusting p& for each 5, which can be 
done easily as their dependence is a smooth monotonous 
function [5S]. When 6 — we obtain unweighted net- 
works that resemble those obtained by certain older mod- 
els [2J[T5] without apparent community structure. How- 
ever, increasing S results in denser networks in which 
communities clearly appear as seen in Fig. [2] This can be 
attributed to the effect of 5 on the LA mechanism. The 
higher the value of 5, the more trapped the local searches 
become, i.e., they repeatedly follow the same links, si- 
multaneously increasing the weights of these links and 
the associated triangles, which in turn further enhances 
the trapping effect. Thus, in the transient phase of the 
model dynamics prior to the stationary state, any emerg- 
ing triangle starts to rapidly accumulate weight, acting 
as a nucleus for community formation. 

We consider first the topological structure of the model 
networks and study the communities using the fc-cliquc 
percolation method [8j[T6]. This avoids the problems of 
modularity-based methods [TTJ [TH1 HH], which may not 
properly resolve communities if their size distribution is 
broad. The LA mechanism, which is mainly responsible 
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for introducing new links, generates at least one trian- 
gle per added link. Therefore, we focus on 4-cliques, the 
smallest non-trivial cliques beyond triangles. Figure [3] 
shows the relative largest community size Rk=4 and av- 
erage community size excluding largest community (n s ) 
for 8 G [0, 1]. When 8 = communities arc mainly very 
small, (n s ) ~ 6 and the largest ones contain ~ 50 nodes. 
Increasing 5 changes the network structure significantly. 
At first, the network becomes homogeneous in the sense 
that the 4-cliques percolate through most of the system, 
but as 8 becomes larger the nodes begin to condensate in 
tighter communities. This can be seen as the increase in 
(n s ) and simultaneous decrease in Rk=4- When 8 > 0.2 
the network contains communities whose (n s ) ~ 20 while 
the largest ones consist of several hundred nodes. The 
probability for communities of size larger than 50 to oc- 
cur is several orders of magnitude higher for 8 > 0.2 than 
for 6 = 0. Similar results were obtained for /c-clique per- 
colation with k = 3 and k = 5. The community size 
distribution was found to be stable after ~ 10 average 
node lifetimes. This can be understood in the large 8 
limit by considering the change in the size N s of commu- 
nity s. When 8 is large the LA process mostly follows 
the strong within-community links and we can assume 
that community merging is rare. We can now estimate 
the change in N s as 

^ = -p dNs +p d N^=0, (2) 

where the first term on the r.h.s. follows from the fact 
that each of the Ns nodes in s is deleted with probability 
Pd, and the second reflects the fact that of the all p^N 
deleted nodes, a fraction of N s /N will form an initial, 
random link to community s, thus becoming a member 
of that community at subsequent time steps by the LA 
process. This shows that once the algorithm has reached 
a state in which most local searches remaining in the ini- 
tial community is valid, the community size distribution 
remains constant. Figures [2] and [3] indicate that 6 > 0.5 
can already be regarded as "large" , because the commu- 
nities are tight and increasing 8 does not change them 
significantly. 

Next, we consider the effect of 8 on weight-topology 
correlations and study the network structure with link 
percolation. Weak links hypothesis implies that links 
within communities are strong whereas links between 
them are weak. Therefore, if the network has such a 
structure, it should disintegrate faster when links are re- 
moved in ascending than in descending order of weight, 
as observed in [7] . Here we remove links from the network 
in both orders while monitoring the network properties 
as a function of fraction of removed links /, which acts 
as a control parameter. We have measured the relative 
size of the giant component Rlcc serving as an order pa- 
rameter, the 'normalized susceptibility' s = X) n sS 2 /N, 
where n s is the number of components of size s and the 
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FIG. 4: (Color online) Rlcc an d s for link percolation. Left: 
weak links removed first, right: strong links removed first. 
Insets: Average clustering. Results are averaged over 10 re- 
alizations of N = 5 x 10 4 networks. Values of 8 are (□), 
1 x 1(T 3 (*), 1 x 1CT 2 (>), 0.1 (A), 0.5 (v), and 1 (o). 

sum is taken over all but the largest component, and 
the average clustering coefficient (c) [5D]. Figure [4] shows 
link percolation results for networks for 8 G [0,1]. For 
small values of 8 it appears that there is no community 
structure compatible with the weak links hypothesis, as 
the giant component is destroyed at the same point for 
both removal orders and s remains very small. However, 
when 8 > 0.1 the networks start to break faster when 
weak links are removed first and s developes a finite sig- 
nature of divergence, indicating that the weak links serve 
as bridges connecting communities. This is also corrobo- 
rated by the the rapid decrease in (c) when strong links 
are removed first (see inset). This effect was also veri- 
fied similarly to Refs. [JJ[3T] using the topological overlap 
measure, which was found to increase clearly as a func- 
tion of link weight when S > 0.1 (not shown). 

Finally, as our model is inspired by mechanisms of so- 
cial network formation, we investigate whether it repro- 
duces also other properties of social networks. The pic- 
ture emerging from analysis of a number of large data 
bases [7J [STJ [S3] shows several common features: i) 
degree distributions are skewed, ii) high-degree nodes 
are often connected to other high-degree nodes (assor- 
tative mixing), Hi) the average clustering coefficient (c) 
is high compared to random networks, and iv) the av- 
erage shortest path lengths are small (the small-world 
property) . Figure [5] presents the basic distributions for 
network quantities with 8 G [0,1]. The degree distribu- 
tion was found to be almost independent of 8. It has a 
quite fast decaying tail with the maximum degree ~ 10 2 
for networks with (k) « 10 and can be fitted reasonably 
with an exponential distribution. The networks show 
high clustering behaving almost as c(k) ~ 1/k. The 
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FIG. 5: (Color online) The model meets criteria (i)-(iv) for 
social networks (see text) when 5 € [0,1]. (a) Degree dis- 
tribution, (b) clustering, (c) average nearest neighbor degree 
k nn , and (d) sampled network diameter as a function of net- 
work size N. Results for (a), (b), and (c) are averaged over 
10 realizations of N = 5 x 10 4 networks. The correspond- 
ing distributions for the mobile phone call network studied in 
Ref. [7] are marked by + (note that this network has average 
degree (k) — 3.14), the exponential fit in (a) is shown as a 
solid line, other symbols are as in Fig. [4] 



model networks are also found to be assortative and the 
diameter grows as log TV, Fig. [5][c,d). 

In this paper we have introduced a simple weighted 
network model in which the weights are an essential 
part of the microscopic mechanisms. They establish 
a coupling between network structure and interaction 
strengths: addition of a new link depends on the ex- 
isting weights, and once a new link is added the weights 
that led to its formation are strengthened. Communities 
will emerge only if this strenghtening is large enough, 
i.e., if nodes favor sufficiently their strong connections in 
the process of establishing new ones. Our study support 
the notion that communities result from initial struc- 
tural fluctuations, which become amplified by repeated 
application of the microscopic processes. In addition 
to fulfilling the standard topological properties of so- 
cial networks, the model networks exhibit a coupling be- 
tween network topology and interaction strengths, which 
is compatible with the weak links hypothesis. Most im- 
portantly, this model provides a starting point for un- 
derstanding the formation of communities in weighted 
networks. 
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